Employing The Complete Face in AVSR to Recover from Facial Occlusions

نویسندگان

  • Benjamin X. Hall
  • John Shawe-Taylor
  • Alan Johnston
چکیده

Existing Audio-Visual Speech Recognition (AVSR) systems visually focus intensely on a small region of the face, centred on the immediate mouth area. This is poor design for a variety reasons in real world situations because any occlusion to this small area renders all visual advantage null and void. This is poor by design because it is well known that humans use the complete face to speechread. We demonstrate a new application of a novel visual algorithm, the Multi-Channel Gradient Model, the deploys information from the complete face to perform AVSR. Our MCGM model performs near to the performance of Discrete Cosine Transforms in the case where a small region of interest around the lips, but in the case of an occluded face we can achieve results that match nearly 70% of the performance that DCTs can achieve on the DCT best case, lips centric approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recover Canonical-View Faces in the Wild with Deep Neural Networks

Face images in the wild undergo larg巳 intra-p巳rsonal variations , such as poses, illuminations, occlusions, and low resolutions , causing great challenges to facerelated applications. This paper addresses this challenge by proposing a new deep learning framework that can recover the canonical view of face images. It dramatically reduces the intra-person variances, while maintaining the inter-pe...

متن کامل

Problems associated with current area-based visual speech feature extraction techniques

Techniques such as principle component analysis (PCA), linear discriminant analysis (LDA) and the discrete cosine transform (DCT) have all been used to good effect in face recognition. As these techniques are able to compactly represent a set of features, researchers have sought to use these methods to extract the visual speech content for audio-visual speech recognition (AVSR). In this paper, ...

متن کامل

Relationship between Length and Width of Maxillary Central Teeth and Facial Indices in Patients with Complete Denture

 Background and purpose: In tooth selection for dentures, the size of the tooth is really important. To estimate the size, a group of landmarks of the face have been investigated. This study evaluated the relationship between the length and width of maxillary central incisors and measurable indices of the face in patients with complete denture. Materials and methods: A descriptive-analytic cro...

متن کامل

Assessment of Facial and Cranial Development in Shirvanian Kurmanj Population Based on the Mean Biometric Factors from Birth to Maturity Age

Purpose: The aim of this study was to determine cranial & facial anthropometric Ratios and assessment of cranial & facial development in Shirvanian kurmanj population. Materials and Methods: This cross sectional analytical study was conducted randomly on 137 boys from shirvan, with normal face patterns. Facial and cranial ratios was estimated and compared. Data were analyzed by SPSS software. T...

متن کامل

Brown Discoloration on the Face

Case: A 54-year-old woman was visited with a history of asymptomatic gray-brown discoloration of the facial skin at our dermatology clinic. The lesions first appeared on her chin and then became progressively darker and extended to her nose and, to a lesser extent, to the periphery of her face over a period of five years. She mentioned that the lesions worsened with heat and sun exposure. Her p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011